In this Power-bi project, we do an analysis of a Data Professional Survey that was done and try to draw some insights from these data. The survey was taken from a total of 630 respondents targeting specifically those working in the data field. Here is a step to step walk through of the project.
The first step was loading data into Power-bi. Power-bi accepts data from multiple data sources. Our data from the survey was stored in an Excel Workbook and so we use that option to load data into Power-bi. Instead of loading the data directly into Power-bi, we transform it to open the data in Power-Query for Data Cleaning.
After loading the data into Power-Query, I then cleaned the data to make it more usable. This involved:
We start our analysis by finding out how many people took the survey. We find there were a total of 630 survey takers. This is a good sample size to continue with our analysis and draw some meaningful insights that could actually be used for decision making by various stakeholders eg an employer or a person looking to enter the data field.
It was also helpful to know the average age of respondents. This can help us better understand some of the choices or answers the respondents gave among other reasons. The average age of survey takers was 29.87 years.
The next stop was to determine which career or job title in the Data Field has the highest salary. We have 5 main career paths in the Data Field; Data Scientist, Data Analyst, Data Engineer, Data Architect and Database Developer, and try to compare their salaries.
Doing this analysis, we find out that Data Scientist are paid the most averaging 94K USD while the least paid were the Database Developers averaging 33K USD. It is important to mention however, of the survery takers, most were Data Analyst and might be the more accurate representation of the real world average salary of Data analyst.
We then analyse the data to find out the country of Respondents. This will help us to filter down our analysis by country as some metrics are country based eg Salary.
We then tried to find out whether the people working in the data field are happy with their work/life balance. This can help someone make a decision on whether to join the field if this is something that's important for them.
We then tried to find out whether people are happy with the salaries they recieve on a scale of 10.
Finally, an important metric that various stakeholders can find useful was finding out the average salary by sex. This can help someone know whether there's any gender bias in the field. The analysis revealed little difference in the wage gap between the two genders with women earning slightly higher than their male counterparts in the sample taken.
Finally we take all the visualizations created and combine them into a dashboard. Here we can filter based on various metrics like country where the respondents live, job title etc and get more personalized analysis based on the insights you need from the data.
To download and view the full project on GitHub, click here.